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Abstract 

Background: Mycobacterium avium subspecies paratuberculosis (MAP) whole cell vaccines have been widely used 
tools in the control of Johne's disease in animals despite being unable to provide complete protection. Current 
vaccine strains derive from stocks created many decades ago; however their genotypes, underlying mechanisms 
and relative degree of their attenuation are largely unknown. 

Results: Using mouse virulence studies we confirm that MAP vaccine strains 316 F, II and 2e have diverse but 
clearly attenuated survival and persistence characteristics compared with wild type strains. Using a pan genomic 
microarray we characterise the genomic variations in a panel of vaccine strains sourced from stocks spanning over 
40 years of maintenance. We describe multiple genomic variations specific for individual vaccine stocks in both 
deletion (26-32 Kbp) and tandem duplicated (1 1-40 Kbp) large variable genomic islands and insertion sequence 
copy numbers. We show individual differences suitable for diagnostic differentiation between vaccine and wild type 
genotypes and provide evidence for functionality of some of the deleted MAP-specific genes and their possible 
relation to attenuation. 

Conclusions: This study shows how culture environments have influenced MAP genome diversity resulting in large 
tandem genomic duplications, deletions and transposable element activity. In combination with classical selective 
systematic subculture this has led to fixation of specific MAP genomic alterations in some vaccine strain lineages 
which link the resulting attenuated phenotypes with deficiencies in high reactive oxygen species handling. 

Keywords: Mycobacterium avium subspecies paratuberculosis, Vaccine, Comparative genomics, Variable genomic 
island, Attenuation, Microarray 



Background 

Mycobacterium avium subspecies paratuberculosis 
(MAP) is a proven enteric pathogen with a wide host 
range that includes many domestic and wild animals [1]. 
It is the causal agent of Johne's disease (JD) in animals 
which is particularly common in countries with signifi- 
cant dairy industries leading to considerable economic 
losses [2]. MAP can infect, disseminate and persist in 
humans and has been suggested as a contributory factor 
in the development of Crohn's disease [3]. 
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MAP vaccines are a major tool used in the control of 
JD in animals and can be highly profitable [4] . They have 
advantages over herd management [5] and culling strat- 
egies [6] in being more cost efficient, easier to imple- 
ment on a wide scale and less reliant on diagnostic 
testing. It is clear however, that although able to prevent 
a majority of animals from reaching onset of clinical dis- 
ease, their current formulations provide incomplete pro- 
tection against infection and shedding [7-9], thus failing 
to eradicate the organism [10]. Most current whole cell 
vaccine preparations rely on subcultures of classic 
strains that were generated over 70 years ago [11] and 
some evidence suggests that, for killed preparations at 
least, more recently acquired local virulent strain types 
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may be more effective [12]. Previous experience with 
BCG has shown that frequent in vitro passage of strains in 
different laboratories led to significant alterations in gen- 
omic profiles and diversities in attenuation and immuno- 
genicity [13]. It is of importance therefore to derive 
accurate definitions of MAP vaccine genotypes to better 
standardize vaccine manufacture and understand the crit- 
ical mechanisms determining vaccine attenuations and 
protective efficacies. 

The distribution and worldwide use of MAP vaccines 
has continued since live 'attenuated' strains were selected 
in France (1924) and the UK (1940) using a method of 
sequential passage similar to that applied for the gener- 
ation of BCG [14]. The degree and mechanism under- 
lying their attenuation however is uncertain as virulence 
studies were not performed in any detail. Concerns in 
the 1980s regarding the use of live vaccine strains be- 
cause of low shelf life and spread to the environment 
promoted the use of killed vaccine formulations. These 
were based on various combinations of three MAP 
strains comprising strain 2e from the UK, strain II from 
Canada and 316 F. A combination of these were used as 
live preparations for Norwegian goats in the 1960s and 
1970s [15] and as a killed preparation delivered in min- 
eral oil to Icelandic sheep in the 1950s [16]. Live vaccine 
formulations of 316 F alone were used in the 1960s and 
70s in the UK [17] and Cyprus [18], 1980s in Hungary 
[19], 1990s in Germany [20] and Spain [21] and up until 
2002 in New Zealand [22]. Killed preparations of 316 F 
alone have been used extensively worldwide [23] and are 
still available for commercial use. These strains, due to 
the difficulty in retaining mycobacteria in frozen seed 
stocks, have been maintained through regular subculture 
on a variety of laboratory in-house media. It is unsurpris- 
ing therefore, that some reports suggest strain adaptation 
to growth in specialized media with loss of Mycobactin J 
dependence [24] and genome diversity [25] has occurred 
amongst some lineages. 

In this work we demonstrate attenuation and differential 
virulence of vaccine strains 2e, II and 316 F in a mouse 
model and use a full MAP genome microarray, supported 
by PCR and sequencing to investigate the genomic shifts 
of vaccine strains from a variety of lineages, including one 
recently resuscitated 316 F strain, originally lyophilised in 
1966. We describe large genomic regions with deletions 
and tandem duplications uniquely associated with each 
vaccine clade, demonstrate the functionality of some of 
these deleted genes and hypothesise as to their role in 
virulence attenuation. 

Results 

Comparative Genomic Hybridisation of vaccine strains 

MAPAC hybridisations comparing each vaccine strain 
against a MAPK10 reference control were made (in 



duplicate) and averaged values displayed as scatterplots 
(Figure la and Figure lb). Significant loss of signals 
in contiguous genes representative of large variable 
genomic island (vGI) deletions were identified in a 26.8 
Kbp region of 316FNOR1960 (vGI-19: MAP3714- 
MAP3735c; Table 1) and a 32.8 Kbp region in both 
IIUK2000 and 2eUK2000 (vGI-20: MAP1694-MAP1727; 
Table 2). Two fold increases in signals were also seen in 
contiguous genes within a 24.9 Kbp region of IIUK2000 
(vGI-21: MAP2705c-MAP2733c; Table 3), a 40.7 Kbp re- 
gion of 316 F-NLD1978 (vGI-22: MAP1750-MAP1789, 
Table 4) and a 11.0 Kbp 316FUK2000 (vGI-lb: 
MAP0096c-MAP0104; Table 5). 

Signal ratios from genes in multiple copies in the genome 
such as insertion elements are more difficult to interpret 
as changes in signal are related to copy number. Neverthe- 
less IS900 signal ratios were significantly reduced in 
IIUK2000, 2eUK2000, 316FNOR1960, 316v, 316FUK2000 
and elevated in 316FNLD1978 and 316FNLD2008. 
Increases in signals for ISi652-like elements were also seen 
in IIUK2000, 2eUK2000, 316FUK2000, 316FNLD1978 
and 316FNLD2008 and decreases for this element in 
316FNOR1960. IS256-like elements were more abundant 
in 2eUK2000 and also raised in 316v and 316FUK2000 but 
decreased in 316FNOR1960 relative to MAPK10. IS1311- 
like elements were increased in 316v and decreased in 
316FNOR1960. One of these elements (MAP0104) borders 
the previously described vGI-lb region which is bounded 
by short inverted repeat sequences and duplicated in MAP 
type III strains [26]. 

The tandem duplication vGI-21 was confirmed using 
primer set MAP2705.seq2 and MAP2733.R (Table 6) 
orientated to amplify only the end of one tandem repeat 
and the beginning of the other. This produced an 
amplicon of 651 bp only with vaccine strain IIUK2000 
and IIUK2001 (Additional File 1). Sequencing of this 
amplicon showed the duplication event to occur relative 
to the reference strain K10 genome (GenBank accession 
AE016958) at position 3066285 within the first copy of 
MAP2733c (truncated at aa251) followed by a unique re- 
gion of 16 bp (CCAGCTCGGCCACCCG) and then a 
second copy of MAP2704 (truncated at aal38) followed 
by the vGI-21 duplication thus comprising a tandem du- 
plication of 24,971 bp spanning 29 ORFs (MAP2705c- 
MAP2733c). PCR using primer pairs specific for this 
tandem duplication MAP2705.seq2 and MAP2733.R was 
negative for all other strains included in this study. 

vGI-22 was confirmed as a tandem duplication using 
primer set MAP1789.F and MAP1750.R (Table 6) which 
produced an amplicon of 2967 bp with vaccine strain 
316 F-NLD1978 only. Sequencing of this region showed 
the duplication event to comprise 40,744 bp spanning 41 
ORFs (MAP1749-MAP1789). The duplication site occurs 
at Genbank accession AE016958 position 1952589 within 
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Figure 1 Microarray scatterplots comparing genomes of test MAP vaccine strains against MAP K10 reference strain. 



the first copy of MAP1790 (truncated at aal73) followed 
by an overlap region of 3 bp (GGG) and then the second 
copy of MAP 1748c (truncated at aal43) followed by the 
vGI-22 duplication. PCR with primer pairs specific for 
this tandem duplication (MAP1789.F, MAP1750.R) was 
negative for all other strains included in this study. 

Several attempts to localise the vGI-lb duplication 
using outward facing primers around the region in a 
similar manner to vGI-21 and vGI-22 were unsuccessful 
and this region may not be present in 316UK2000 as a 
tandem duplication. qPCR was performed using both 
MAP0101 and MAP0103 specific primer pairs located 
inside vGI-lb and the relative copy number of each 
compared against a single copy genome target control 
represented by primer pairs to MAP2114c. Both 
MAP0101 and MAP0103 pairs showed a doubling of 
signal strengths relative to MAP2114c in strain 
316FUK2000 whilst all other strains included in this 
study showed signal consistent with single copies of 
both these genes (Table 7). 



\S900 insertion site analysis 

To determine which IS900 sites were absent relative to the 
K10 reference genome, PCR primers were designed to spe- 
cifically amplify each of the known 17 1S900 loci (Table 6). 
These were used to confirm the insertion of IS900 into each 
locus in the reference strain K10 and were also all positive 
in all 316 F strains and a caprine isolate CAM87. Both 
vaccine strains IIUK2000 and 2eUK2000 were missing 
IS900(MAP1722) whereas IS900(MAP1033) was also 
missing from vaccine strain 2eUK2000 but present in all 
other strains including vaccine strain IIUK2000. Compara- 
tive qPCR of IS 900 copy number relative to MAP2114c, 
demonstrated a range of IS900 copies in vaccine strains that 
corresponded to the trend in hybridisation signals observed 
in MAPAC scatterplots (Figure la & lb). The ratio of copy 
number however was surprisingly higher than predicted, 
with vaccine strains IIUK2000 and 2eUK2000 having only 
13 copies whilst MAPK10 and 316 F strains gave signals 
correspondent with 16-19 relative IS900 copy numbers 
(Table 7). 
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Table 1 Deleted region (vGI-19) in 316FNOR1960 


Gene 


Name 


Putative function 


Process 


MAP3714 


fadD2 


LuxE like Acyl-coA synthase 


Fatty Acid Metabolism 


MAP3715 




Unknown 




MAP3716C 


fadE6 


Acyl-CoA dehydrogenase 


Fatty Acid Metabolism 


MAP3717C 




Dipeptidyl aminopeptidase 




MAP3718C 




Transcriptional regulator 




MAP3719 




Glyoxalase 


Anti-host killing factor 


MAP3720 




Unknown 




•MAP3721 




polyketide synthase associated protein PapA2 


Cell Adhesion 


•MAP3722 




Unknown 




*MAP3723c 




Transcriptional regulator 




*MAP3724 




Unknown 




*MAP3725 




PPE-1 


Signalling 


*MAP3726 


fepD 


ABC-type cobalamin/Fe3+ siderophore transporter 


Transporter 


*MAP3727 


fepC 


ABC-type cobalamin/Fe3+-siderophore transporter 


Transporter 


•MAP3728 


fepB 


ABC-type cobalamin/Fe3-l — siderophore transporter 


Transporter 


•MAP3729 


tauD 


Taurine catabolism dioxygenase 


Taurine resistance 


*MAP3730 


tehB 


O-methyltransferase 


Tellurite resistance 


*MAP3731c 




ABC non-ribosomal derived peptide transporter 


Transporter 


*MAP3732c 




ABC non-ribosomal derived peptide transporter 


Transporter 


*MAP3733c 




ABC non-ribosomal derived peptide transporter 


Transporter 


*MAP3734c 




ABC non-ribosomal derived peptide transporter 


Transporter 


*MAP3735c 




ABC non-ribosomal derived peptide transporter 


Transporter 



* = MAP specific genes. 



Functional analysis of tellurite resistance 

One MAP specific gene predicted to be deleted in vGI- 
19 was MAP3730 (Table 1), a S-adenosylmethionine 
-dependent methyltransferase with homologues to tellur- 
ite resistance genes (tehB) involved in bacterial virulence 
and persistence [27,28]. The functionality of this gene in 
mycobacteria has not previously been investigated. Using 
a solid culture plate assay we compared tellurite resist- 
ance (MIC) of MAP strains with and without the vGI-19 
deletion (Table 7). This demonstrated a wide MIC range 
(8 - >512 ug/ml) between strains, with significant 
reductions associated with vGI-19 (316FNOR1960) dele- 
tion over full genome complement strains. Of note how- 
ever was the very low level of tellurite resistance (8 ug/ 
ml) found in strains containing the vGI-20 (IIUK2000 & 
2eUK2000) deletion. 

Assessment of virulence using a mouse model 

The virulence of vaccine strains 316FUK2001, IIUK2001 
and 2eUK2001 was compared with wild type strain 
JD87/107 in a mouse model. Ten mice from each of five 
groups (four inoculated with the different MAP strains 
and a negative control group inoculated with PBS) were 
killed at 4, 8 and 12 weeks post inoculation. Body, spleen 



and liver weights were recorded. Samples of the liver 
were taken for bacteriological culture and histopath- 
ology. Mean bodyweights increased with age, but no 
statistically significant difference was observed in mean 
body weight between any of the vaccine strains and the 
control wild type strain at any of the time points 
(p=0.11). There was strong evidence of a difference in 
pattern over time between the mean rank spleen weight 
expressed as a percentage of body weight of the mice 
inoculated with the different MAP strains relative to 
those receiving PBS, where those inoculated with MAP 
had increasingly higher mean rank percentage weights 
(P=0.006). Differences between the MAP strains were 
not formally statistically significant (p=0.06) although 
the control virulent strain JD87/107 showed an increase 
in mean rank spleen weight percentage between weeks 4 
and 8, and 316FUK2001 had an increase between 
weeks 8 and 12. There was no statistical evidence for 
differences in the mean levels of liver weight expressed 
as a percentage of body weight either for different 
strains or over time for any of the MAP strains (p = 0.2). 
However, there was some evidence of a difference be- 
tween the means for the MAP strains and the lower 
mean weights associated with PBS (p = 0.018). 
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Table 2 Deleted region (vGI-20) in vaccine strains IIUK2000 and 2eUK2000 



Gene 



Name 



Putative function 



Process 



MAPI 694 
MAPI 695c 
MAPI 696c 
MAPI 697 
MAPI 698c 
MAPI 699 
MAPI 700c 
MAP1701C 
MAPI 702c 
MAPI 703c 
MAPI 704c 
MAPI 705c 
MAPI 706 
MAPI 707 
MAPI 708 
MAPI 709 
MAP1710 
MAPI 71 1c 
MAP1712 
MAP1713 
MAP1714 
MAPI 715 
MAP1716 
MAPI 71 7 
*MAP1718c 
*MAP1 71 9c 
•MAP1720 
*MAP1721c 
*MAP1722 
*MAP1723 
*MAP 1724c 
*MAP1725c 
*MAP 1726c 
•MAP1727 



papA2 

hsp18_1 

hsp18_2 
thil 



fadDII 2 



fadDE20_ 
fadA_2 
fadB 2 



Polyketide synthase 
Transcriptional regulator 
Heat Shock Protein 
Transcriptional regulator 
Heat Shock Protein 
Thiamine biosynthesis 
|3-lactamase 

Biotin carboxylase subunit 

Thioredoxin 

Unknown 

Glyoxalase 

Transcriptional regulator 
Chitinase-like protein 
3-ketoacyl-(acp) reductase 
Phosphohydrolase-like protein 
LuxE like Acyl-protein synthetase 
3-oxoadipate enol-lactonase 
Transcriptional regulator 
CoA-transferase family III 
Acyl-CoA dehydrogenase 
Acyl-CoA dehydrogenase 
Acyl-CoA dehydrogenase 
Short chain dehydrogenase 
Unknown 

Low identity to MTB immunogen MPT63 
Transcriptional regulator 
Nucleoside-diphosphate-sugar epimerases 
Transcriptional regulator 
IS900 

NADP oxidoreductase coenzyme F420-dependent 

Cytochrome b N 

Catalase 

Transcriptional regulator 
Esterase lipase 



Cell Adhesion 
Stress protein 
Stress protein 



Anti-host killing factor 



Fatty Acid Metabolism 

Fatty Acid Metabolism 
Fatty Acid Metabolism 



Fatty Acid Metabolism 
Fatty Acid Metabolism 
Fatty Acid Metabolism 
Fatty Acid Metabolism 

Cell internalization factor 

Cell envelope biosynthesis 



Anti-host killing factor 
Fatty Acid Metabolism 



: MAP specific genes. 



MAP was recovered from the liver tissue of mice four 
weeks post inoculation in all groups except the control 
group inoculated with PBS. By 12 weeks post infection, 
MAP was recovered from the tissues of only one mouse 
inoculated with vaccine strain 2eUK2001 (mean count 
46 cfu/g), from 6 mice inoculated with IIUK2001 (mean 
counts between 46 and 315 cfu/g) and from all the mice 
inoculated with the virulent JD87/107 strain (mean 
counts 1.4-7 x 10 cfu/g) suggesting attenuation of each 
of the vaccine strains (Figure 2a). Mean rank counts 



increase over time for the JD87/107 strain, while drop- 
ping for all the other MAP strains, this being most rapid 
for the 2eUK2001 strain but ultimately most notable for 
strain 316FUK2001. Statistical assessment of the effect 
of the strain by time interaction on the mean rank count 
indicate that differences exist in the abilities of the MAP 
vaccine strains to survive or persist in mice (p=0.02). 

Infections of the liver result in multifocal hepatitis 
characterised by clusters of inflammatory cells. To meas- 
ure the effect of the experimental infection with MAP, 
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Table 3 Tandem duplicated region (vGI-21) in vaccine strain IIUK2000 


Gene 


Name 


Putative function 


Process 


MAP2705C 




Ketosteroid isomerase 




MAP2706C 




N-acylglucosamine 2-epimerase like 




MAP2707C 




Signal protein 




MAP2708C 




Glutamine amido transferase 


Amino acid-Lipid biosynthesis 


MAP2709C 


tesB2 


Acyl-CoA thioesterase II 


Amino acid-Lipid biosynthesis 


MAP2710C 


pdxS 


Pyridoxal 5'-phosphate synthase 


Amino acid-Lipid biosynthesis 


MAP2711C 




Nudix Hydrolase 26 




MAP2712C 




Phosphatidylinositol alpha-mannosyltransferase 




MAP2713C 




Lauroyl acyltransferase 


Lipid biosynthesis 


MAP2714C 


pgsA 


CDP-alcohol phosphatidyltransferase 


Lipid biosynthesis 


MAP2715C 




Diadenosine tetraphosphate 




MAP2716C 


thrS 


threonyl-tRNA synthetase 




MAP2717 




Na+/H+ antiporter 




MAP2718C 




N-alpha-acyl-glutamine aminoacylase 


Amino acid biosynthesis 


MAP2719C 




Acyl-CoA thioesterase 


Amino acid biosynthesis 


MAP2720C 




Aldo-keto reductase 




MAP2721 




Unknown 




MAP2722C 




2-Nitropropane dioxygenase 




MAP2723C 




Unknown 




MAP2724C 




Unknown 




MAP2725C 




Transcriptional regulator 




MAP2726C 


fdxA 


Ferredoxin 


Fe scavenging 


MAP2727 




S-adenosylmethionine-dependent methyltransferase 




MAP2728 




Transcriptional regulator 




MAP2929C 


pcaB 


Lyase I like 


Benzoate metabolism 


MAP2730C 


pcaG 


Protocatechuate 3,4-dioxygenase 


Benzoate metabolism 


MAP2731C 


pcaH 


Protocatechuate 3,4-dioxygenase 


Benzoate metabolism 


MAP2732C 




Hemerythrin oxygen binding like protein 




MAP2733C 




Rieske (2Fe-2S) protein 





the densities of mononuclear leucocyte clusters were 
counted. Small numbers of leucocyte clusters (< 0.22 
clusters per field) were observed in liver samples from 
mice inoculated with PBS, though no acid fast bacilli 
(AFB) were detected in any of these samples. Large 
differences in the mean ranked density of leucocyte 
clusters between strains were identified (p<0.001) with the 
wild type strain JD87/107 having the highest mean ranked 
densities of clusters (Figure 2b). Strain 2eUK2001 showed 
evidence of higher mean rank densities than the 
316FUK2001 and IIUK2001 strains (p = 0.03). The ranked 
density of leucocyte clusters with AFB showed highly 
statistically significant differences between the means of 
MAP strains (p<0.001), with the JD87/107 strain consist- 
ently showing higher mean densities, with this effect being 
more pronounced from 8 weeks post infection (Figure 2c). 
The vaccine strains all tended to exhibit increasingly lower 



mean ranked densities over the lifetime of the experiment 
(p=0.002), with consistent patterns of differences between 
strains (p=0.008): strain IIUK2001 showed the largest 
mean rank densities, strain 316FUK2001 the lowest, with 
2eUK2001 intermediate. The histopathology results show 
that all strains elicited a similar inflammation at 4 weeks. 
Only thereafter some differences between the inflamma- 
tory responses to the strains became apparent. In addition, 
the analysis of mean bacterial counts and AFB positive 
clusters showed the reduced ability of the vaccine strains 
to survive and persist within mice. Overall, these results 
provide proof of attenuation of the vaccine strains with re- 
spect to a wild type MAP strain. 

Discussion 

In this study, we examined genomic and phenotypic 
characteristics of a panel of MAP vaccine strains obtained 
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Table 4 Tandem duplicated region (vGI-22) present in vaccine strain 316FNLD1978 



Gene 



Name 



Putative function 



Process 



MAPI 750 




Unknown 




MAP1751 


wcaG 


NAD-dependant epimerase/dehydratase 


Cell envelope biosynthesis 


MAPI 752c 




Aldo-keto reductase 




MAPI 753 




Acyltransferase 




MAPI 754c 


uspA 


Adenine nucleotide alpha hydrolase 


Stress Protein 


MAPI 755c 


nirC 


Formate/nitrite transporter 




MAPI 756c 


rshA 


Anti-sigma factor 




MAPI 757c 




Sigma-70 factor 




MAPI 758c 


nrtC 


Acyl-CoA dehydrogenase 




MAPI 759c 


osmC 


Osmotic shock-like protein 




MAPI 760c 




tat translocated iron dependent peroxidise 




MAPI 761c 




Peptidase like lipoprotein 




MAPI 762c 




FTR1; high affinity iron transporter 




MAPI 763c 




Unknown 




MAPI 764 




Unknown 




MAPI 765c 


rssA 


Patatin like esterase 


Fatty acid Metabolism 


MAPI 766c 




Cellobiose ABC transporter 




MAPI 767c 




ABC transporter 




MAPI 768c 




ABC transporter 




MAPI 769c 




Transcriptional regulator 




MAPI 770c 




Sigma-70 factor 




MAP1771c 




\S900 




MAPI 772 




Unknown 




MAPI 773c 




Nuclear transport factor 2 


Steroid metabolism 


MAPI 774c 


pncA 


Isochorismatase 


Nicotinamide metabolism 


MAPI 775 


doxX 


Unknown 




MAPI 776c 




Nucleotide phosphokinase-like 




MAPI 777c 




Unknown 




MAPI 778c 


aroK 


Shikimate kinase 




MAPI 779c 




Phosphotransferase 




MAPI 780c 


lipT 


Esterasejipase 


Fatty acid Metabolism 


MAPI 781 


Ippl 


Lipoprotein 




MAPI 782c 




p450 cytochrome 




MAPI 783 




Transcriptional regulator 




MAPI 784c 




Lipoprotein 




MAPI 785 




15900 




MAPI 786c 




Dihydropicolinate reductase-like 




MAPI 787c 




Short chain dehydrogenase 




MAPI 788 




Transcriptional regulator 




MAPI 789 




Unknown 
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Table 5 Duplicated region (vGI-lb) in vaccine strain 316FUK2000 



Gene 


Name 


Putative function 


Process 


MAP0095C 


nuol_1 


NADH dehydrogenase subunit 1 




MAP0096C 




Membrane fatty acid desaturase 


Fatty acid metabolism 


MAP0097c 




Membrane fatty acid desaturase 


Fatty acid metabolism 


MAP0098C 




Transcription regulator 


expression regulator 


MAP0099 


csbD 


Stress protein 


stress response 


MAP0100 


gcs2 


Glutamate-cysteine ligase 


glutathione biosynthesis 


MAP0101 


gcs2 


Glutamate-cysteine ligase 


glutathione biosynthesis 


MAP0102 




Unknown 




MAP0103C 




Unknown 




MAP0104 


IS 1 3 1 1 


Transposase 


DNA mutator transposon 



from several laboratories around the world including both 
low and high passage examples of the 316 F lineage. Using 
a mouse model, we assessed the virulence ofrepresentative 
clades of three vaccine strains (2e, II, 316 F) with respect 
to a virulent MAP clinical isolate. The vaccine strains were 
clearly attenuated with regard to their ability to survive 
and persist in the mice as evidenced from the reduced 
numbers of MAP recovered and reduced numbers of 
leucocyte clusters containing AFB in the livers. This 
supports previous studies showing decreased persistence 
of the same 316 F and 2e strains in calves after 8 months 
[29] and illustrates the utility of the C57BL/6 mouse 
model for virulence studies. 

Using a pan-genomic MAP/MAH microarray we 
demonstrated that the genomes of all but one of the 
316 F strains in the test panel contain the same full gen- 
ome complement as the reference virulent bovine MAP 
type II strain MAPK10. One 316 F strain obtained from 
Norway (316FNOR1960) contained a single deleted re- 
gion (vGI-19) spanning 21 ORF's (including 10 MAP 
specific genes). Two strains not of the 316 F lineage 
(2eUK2000 and IIUK2000) contained a different deleted 
region (vGI-20), identical in both strains, spanning 34 
ORF's (including 10 MAP specific genes). Specific PCR 
for MAP1723 (Table 6), located within the vGI-20, on 
2eUK2001 and IIUK2001 were negative confirming that 
the vGI-20 region was not present in these strains 
(Additional File 1). The inclusion of MAP-specific genes 
in these deleted regions is an important observation as 
these genes could provide the basis for differentiating 
infected from vaccinated animals (DIVA). Indeed, the 
deleted region vGI-19 contains part of the 38 kb patho- 
genicity island described by Stratmann et al. (2004) [30] 
which contains genes encoding a number of antigens 
with diagnostic potential [31]. Both deleted regions 
vGI-19 and vGI-20 contain genes potentially involved in 
virulence and pathogenesis (Table 1) and their deletion 
could therefore have a profound effect on the virulence 
of these strains. In this study we demonstrated using a 



mouse model that both vaccine strains 2eUK2001 and 
IIUK2001 were attenuated with respect to a wild type 
MAP strain. In addition, vaccine strain IIUK2000 and 
IIUK2001 were found to contain a large 41 ORF tandem 
duplication (vGI-21) which includes copies of benzoate 
and lipid metabolic pathways. Vaccine strain 2eUK2000 
comes from the same stock as 2eUK2001 and was 
maintained at the VLA, UK for over 50 years on a min- 
eral deficient medium (Watson Reid A' block) whilst 
vaccine strain IIUK2000 was not. We suggest that the 
vGI-21 duplication in vaccine strain IIUK2000 was 
selected by these differences in media and fixed into the 
genome to compensate in vitro for the deletion of lipid 
biosynthesis and carbon usage repertoires, removed by 
the vGI-20 deletion. 

The large deletion vGI-19 present in vaccine strain 
316FNOR1960 was not present in any of the other 316 - 
F strains including an early low passage lineage 
(316FCYP1966) and a more recent isolate (316 F2001) 
shown to be significantly attenuated in our virulence 
studies. Notably, part of vGI-19 is also present in the 
same gene order within the related MAH104 genome 
(GenBank reference CP000479). Together these suggest 
that any ancestral precursor and therefore the original 
316 F strain would be unlikely to be missing vGI-19. We 
hypothesise that the vGI-19 deletion appeared in the 
316FNOR1960 strain some point after its acquisition 
and transfer to Norway in 1960 from the VLA, UK. This 
strain is recorded as having been maintained, uniquely 
on Dubos medium with added pyruvate [15] and we hy- 
pothesise that this medium was at some point selective. 
This is supported by the vGI-19 deletion in this strain 
including gene homologues of glyoxylate enzymes 
associated with pyruvate metabolism [32]. This strain 
previously has been used successfully as a live vaccine 
suggesting that it is attenuated. The knockout of the 
glyoxylate shunt could significantly affect the strain's abil- 
ity to control anaerobic respiration [33] and intracellular 
persistence [34], which may indicate that attenuation in 
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Table 6 PCR primer sets used in this study Table 6 PCR primer sets used in this study (Continued) 



Gene set 




Sequence 


IS900(MAP1722) 




vGI-1 b: (internal gene primer sets) 




MAPI 720-MAP1 721 c.F 


CTAAGGTGGCCAGCGTTTCT 


MAP0101.F 




GGTTACCGACTTGGTCCAGA 


MAP1723.R 


GGTTGGAGACAACCTCGTTC 


MAP0101.R 




CCCGTCAGATCCATTACGAC 


IS900(MAP 1771c) 




MAP0103.F2 




GAGCGCCAACCTATCTCAAC 


MAP1770c.F 


ACAATTCGGCGATCGTCTCG 


MAP0103.R 




TGGTTTGGAGATCCCTGAAG 


MAP1772.R 


CGCGGACAGACACAGGTAGG 


vGI-19 (internal gene primer set) 




IS900(MAP1785) 




MAP3733c.F 




TTCATGTTGTTCCTCACCCG 


MAP1784c.F 


GTCGACCATCGCTCTTCCCT 


MAP3733C.R 




CATCGCGAGCGTAGCCGCTG 


MAP1786C.R 


ATCGGTGTCGAGGACATFCC 


vGI-20 (internal gene primer set) 




IS900(MAP 1793c) 




MAP1723.F 




GGAGTATGGAACCATCGCTG 


MAP1792.F 


CAATGGATGGTCGTCACCTG 


MAP1723.R 




TGATGTAACGGGCGTGCAAG 


MAP1794.R 


CGGCCCGCTTGATCCATTTG 


vGI-21 (specific for tandem 


duplicated region in IIUK2000/1) 


IS900(MAP2034c) 




MAP2705.SEQ2 




CGATGAAGGCCTACTGGTC 


MAP2033-MAP2034C.F 


CCGCAAGTAGTGCACTATGG 


MAP2733.R 




TCAACTGGCTCCTCCTTTTG 


MAP2035.R 


TATTCGGGGTTGTTCAGGGA 


vGI-22 (specific for tandem 


duplicated region in 316FNLD1978) 


IS900(MAP2108) 




MAP1789.F 




TTTGGCACTACAACGAGCTG 


MAP2105.F 


CAGGCACGGAACACAGTTCG 


MAP1750.R 




GATCGAGAAGAGGGGAGTCA 


MAP2109C.R 


TGTTCGGCTACGGCATACTG 


MAP21 14c (single copy gene for control PCR) 


IS9O0(MAP2157) 




MAP21 14c.F 




TGGAAGCTACCCAAATCACC 


MAP2156.F 


CTGCAAACACAGCCCAATC 


MAP2114c.R 




GAGAAGTAGGCCGCGTAGTC 


MAP2158.R 


CAACTTCGGCAAGTTCACC 


5900 






IS900(MAP2203c) 




AV-1 




ATGTGGTTGCTGTGTTGGATGG 


MAP2202c.F 


TCCCGGTAGAAGATCATGTG 


AV-2 




CCGCCGCAATCAACTCCAG 


MAP2204c.R 


GACAATCTGCCGTCGTATCA 


pre16SrRNA 






IS900(MAP2444c) 




pre16SrRNA.R 




GCGCAGCGAGGTGAATTT 


MAP2443.F 


AACCTTGACCCACACCTTCC 


pre16SrRNA.F 




TTTGGCCATACCTAGCACTCC 


MAP2445.R 


TGAGCTCGCCGGCGAAATA 


Tellurite (tehB) 






IS9O0(MAP2577) 




MAP3730.F 




GGTTTGACGACAGAGATGCG 


MAP2567C.F 


CGTTCTGGGCATCCATCGACG 


MAP3730.R 




GGCATGGTGTACGACAAGGA 


MAP2578.R 


TCACGGCGGTCAGGTTACTTC 


MIRU3 (MAP3981-2) 






IS900(MAP3480) 




MIRU3.F 




ACATTCACCCTGTCCATTCC 


MAP3479C.F 


GTTGAACTTTCCGACCAACC 


MIRU3.R 




CCTCCTTACGGAGCAGGAA 


MAP3480-3481.R 


GGTTAGCGGGAGAAAAGCTC 


IS900(MAP0034) 






IS900(MAP4066) 




MAP0033c.F 




AGCTGCCGGGAGTTGATCT 


MAP4065.F 


TGGGCCTGAGGTCAGAACCA 


MAP0035.R 




TCACGGCTCACCGCACGCT 


MAP4067C.R 


GAAGACCACCTCTACCTCAC 


IS900(MAP0159c) 






IS900(MAP4281) 




MAP0158.F 




CCCGTGACAAGGCCGAAGA 


MAP4280.F 


GCTGACCGAGAAGGGCTAC 


MAP0160.R 




CGI I I IGCACCTCGATGGCC 


MAP4282.R 


CGTAAGTGACTGGCTCATGG 


IS900(MAP0967) 






MAP gene annotation corresponds to GenBank AE016958. 


MAP0966c-MAP0967.F 




CGTGACGAACGACATGTGTT 






MAP0967-MAP0968.R 




GCTCGAGACATTTAGCCCAC 


this strain may be related to this loss. A 2002 isolate of this 


IS900(MAP1033) 






strain was shown to be highly immunogenic in goats [35] 


MAP1032c.F 




GTAGAACCCGACCAGCAGC 


and forms part of a 


316 F/2e live commercial vaccine 


MAP1034c.R 




GACCTGGCACCTGTATCCC 


product (Paratuberkulose-vaksine, National Veterinaria 
Institute, Oslo, Norway) however there is no evidence that 
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Table 7 MIRU, vGI, Tellurite MIC and \S900 I MAP0101 / MAP0103 genome copy numbers 



Strain 


MIRU3 
bD (codv no) 


vGIs* 


Tellurite MIC 
ug/ml 


Copy number relative to MAP21 14c 
\S900 MAPOIOI/MAP0703 


316FNLD2008 


244 (1) 




128 


17 


1 


316FUK2001 


244 (1) 




nd 


nd 


nd 


316FNLD1978 


244 (1) 


22 dp 


128 


19 


1 


316FUK2000 


297 (3) 


1b dp 


128 


16 




316FNE04/81 


297 (3) 




nd 


17 


1 


316FNE08/81 


297 (3) 




nd 


17 


1 


316FNE068451-2 


297 (3) 




nd 


17 




316FNE069341 


297 (3) 




nd 


17 




31 6v 


297 (3) 




nd 


16 




316FCYP1966 


297 (3) 




> 512 


16 




316FNOR1960 


297 (3) 


19 dl 


64 


16 




IIUK2000 


297 (3) 


20 di &21 d P 


8 


13 




2eUK2000 


297 (3) 


20 dl 


8 


13 




MAPK10 


297 (3) 




128 


17 




CAM87 


297 (3) 




256 


17 




nd = not done. 



t vGI status for vGI-lb, vGI-19 - vGI-22 either duplicated (dp) or deleted (dl), else no entry designates present as a single copy region. 



vGI-19 was deleted when the strain was used, albeit very 
effectively, on goats in the 1960-70's [15]. 

We tested the potential impact provided by deletion of 
the putative tellurite resistance gene (tehB) included in 
vGI-19 on 316FNOR1960 phenotype. Tellurite is highly 
toxic to bacteria due to its action on DNA synthesis. It 
is an important mechanism by which animals combat 
intracellular microorganisms [27] and was used in early 
studies as a tuberculosis/leprosy therapeutic [36]. Bacter- 
ial resistance to tellurite is inducible, is associated with 
virulence [28] and is linked to catalases which are 
required to process the superoxide anions generated as a 
result of bacterial metabolic mechanisms used to inacti- 
vate tellurite. We show a significantly increased sensitiv- 
ity to tellurite in 316FNOR1960 whilst other 316 F 
strains either matched or exceeded the resistance of the 
two wildtype strains tested (K10:bovine, CAM87:cap- 
rine). Interestingly the strains most sensitive to tellurite 
were IIUK2000 and 2eUK2000 which lack the tehB gene. 
The metabolism of tellurite generates high reactive oxy- 
gen species which subsequently need to be de-toxified 
by catalase [37]. Significantly the vGI-20 deletion 
in these strains includes loss of the catalase gene 
homologue MAP1725c. Both vaccine deletion regions 
thus involve alterations in metabolic pathways associated 
with deactivation of high reactive oxygen species tox- 
icity, which suggests this may be an important mechan- 
ism underlying attenuation in these strains. 

Several of the other vaccine strains tested are also 
reported to have been maintained on markedly different 
media which may have similarly promoted or selected 



for genomic and phenotypic diversities. 316FNLD1978, 
available as a heat killed vaccination for dairy cattle since 
1985 [38], was found to contain a large tandem duplica- 
tion (vGT22) unique to this strain. It is notable that this 
isolate was selectively subcultured on potato starch 
medium to enhance its growth (P. Willemsen personal 
communication) and now grows with difficulty on other 
media. It is tempting to speculate that the acquisition of 
extra copies of 14 ORFs including cell wall, fatty acid bio- 
synthesis genes and two extra copies of IS900 are a direct 
result of the selective process performed on this strain. 

We demonstrated in this study that vaccine strain 
316FUK2001 was clearly attenuated with respect to wild 
type MAP strain JD87/107. The vGI-19 deletion found 
in 316FNOR1960 and the vGI-20 deletion found in 
2eUK2000 and IIUK2000 were not detected by PCR in this 
strain suggesting that attenuation in this strain is due to dif- 
ferent genetic polymorphisms. A duplicated region (vGI- 
lb) was detected in vaccine strain 316FUK2000, which may 
possibly have arisen as an adaptation to growth on liquid 
Watson Reid media. 

Insertion sequences bordering specific genomic regions 
have previously been associated with variably tandem 
duplicated [39] and possibly large genomic inversions [40] 
and contribute highly to the degree of plasticity in the 
MAP genome. Variations in copy number of insertion 
elements including IS900, IS1311, IS256 and ISJ652-like 
elements were seen between vaccine strains and virulent 
isolates. An IS1311 was found immediately bordering the 
vGI-lb region duplicated in 316 F-UK2000 but not other 
316 F strains. Similar genomic variations including vGI-lb 
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(See figure on previous page.) 

Figure 2 Virulence assessment of vaccine (2eUK2001, 316FUK2001, IIUK2001) and wild type (JD87/107) MAP strains in a mouse model. 

A. Quartile-Based Box and Whisker plots of bacterial load (CFU/g) in the liver at 4, 8 and 12 weeks post-infection. B. Quartile-Based Box and 

Whisker plots of mean ranked density of leucocyte clusters in the liver at 4, 8 and 12 weeks post-infection. C. Quartile-Based Box and Whisker 

plots of mean ranked density of leucocyte clusters with AFB in the liver at 4, 8 and 12 weeks post-infection. * indicates an unusually large or 

small observation (outlier). Values beyond the whiskers are outliers. The top of the box is the third quartile -75% of the data values are less than 

or equal to this value. The bottom of the box is the first quartile -25% of the data values are less than or equal to this value. The median is 

shown within the box. The whiskers extend to the highest and lowest data values which have not been identified as outliers. 

RMfimn 



have been observed in virulent MAP strains [26] . IS900, a 
definitive element of MAP found in all clinical and vaccine 
strains, was also shown to be present in a variety of copy 
numbers. This work used comparative ratios of qPCR 
signals to estimate the average number of IS900 copies per 
cell per culture relative to two single copy MAP genes 
using an assumption determined from a MAP assembled 
genome sequence that MAPK10 would contain 17 copies. 
Our results confirm previous studies showing the vaccine 
strain 316v used in Australia for ELISA testing [41] 
contains one less genomic copy of IS900 [42] than most 
other 316 F strains [25]. Vaccine strain 316FNLD1978 
exhibited higher gene signal ratios consistent with the two 
extra copies of IS900 copies inside the duplication of vGI- 
22. Vaccine strains IIUK2000 and 2eUK2000 contained 
lower signal ratios consistent with loss of an IS900 copy 
inside the deletion region vGI-20. Consistently however 
the calculated IS900 copy number in these strains was 
much lower than expected using the ratio method. Using 
site specific PCR we confirmed 16 IS900 filled insertion 
sites in the genomes of these strains whereas the ratio 
method, using MAPK10 as a standard, predicted only 13 
copies. The reason could be technical, perhaps involving 
incomplete bacterial lysis of these unusual strains, how- 
ever IS900 is known to replicate in episomal minicircles 
[43] and when all consensus insertion sites are filled they 
may exist as extra genomic components awaiting trans- 
position. If this is indeed the case, virulent MAP strains 
would have the capacity to contain more than the 
predicted 17 IS900 copies per cell. This could be an im- 
portant factor in studies relying on qPCR to determine ac- 
curate estimates of MAP load [44]. 

MIRU3 is a short tandem repeat sequence located within 
the sensX3-regX3 two component signalling system that 
controls carbon source usage and mechanisms reducing 
damaging reactive oxygen species generated by aerobic me- 
tabolism [45]. The attenuated BCG vaccine characteristic- 
ally contains a low MIRU3 tandem repeat copy number 
which has been suggested to be involved in the control of 
sensX3-regX3 expression [46]. In this study 316 F strains 
(316FNLD1978, 316FUK2001, 316FNLD2008) had low 
MIRU3 copy numbers whilst others, mostly originating 
from older culture stocks, were larger. This may represent 
a potential for genomic drift within recent 316 F lineages 
and could be used as a marker for the most current 316 F 



VLA, UK strain but may also be suggestive that loss of 
intracellular persistence in full genome complement 316 F 
strains may be associated with reduced transcriptomic sig- 
nalling capacity. It will be of interest therefore in future 
total genome sequencing studies to compare dysfunctional 
SNP variations within signalling features of 316 F strain 
genomes. 

Conclusions 

This study has shown that significant genomic diversity 
exists between MAP vaccine strains and within the 
316 F lineage. These include large deletions, duplications 
and changes in insertion sequence copies. These 
mutations were probably derived in a classical manner 
by selective subculture on laboratory media and in some 
cases have led to significant alterations of phenotype and 
attenuation. There were 25 MAP specific gene deletions 
identified of which at least one could be linked to 
phenotypic change that would disadvantage its persist- 
ence in the host and thus associates it with virulence. 
Furthermore, these MAP-specific gene deletions could 
provide the basis for a DIVA diagnostic for use with 
these vaccines. Overall, this work illustrates that MAP 
genome plasticity can be influenced by in vitro culture 
over long periods and a robust definition of vaccine 
strain genome lineage will be necessary in the future to 
guarantee consistency between studies. 

Methods 

Strains and culture media 

MAP strains used in this work, their origins, sources 
and media used for propagation are described in 
Table 8. 

DNA extraction 

DNA was extracted for typing and arrays using a previ- 
ously described protocol [26]. Briefly, 1x10 cells of 
cultures grown on liquid Middlebrook 7H9 medium for 
up to 12 weeks were pelleted, washed once in lx PBS, 
then resuspended in 650 ul mycobacterial lysis buffer 
(0.5 M EDTA -pH 8.0-, 5 M NaCl, 1 M TrisHCl, 10% 
SDS and 8.6 ml H 2 0). Colonies were emulsified by passing 
10 times through a 25 G needle, and treated overnight at 
37°C with 5 ul lysozyme (Stock 100 mg/ml: Sigma), 10 ul 
proteinase K (Stock 10 mg/ml: Sigma) and 4 ul lipase 
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Table 8 Details of MAP strains used in this study 



Name 


Origin and source 


Medium used for maintenance and 
propagation 


316FNOR 1960 
(Vaccine strain) 


Obtained from the VLA in 1960 and used in a vaccine trial in goats in Norway 
during the 1960s [15]. Maintained at the Norwegian Veterinary Institute, Oslo. 


Selective Dubos medium [47] supplemented 
with mycobactin (2 ug/ml) and pyruvate 
(4 mg/ml) 


316FCYP1966 
(Vaccine strain) 


Obtained from the VLA in 1966 as lyophilised aliquots and used to vaccinate 
goats in Cyprus during the 1960s [18]. Strain used in this study was recovered 
from an aliquot lyophilised on 04 January 1966 and resuscitated in 2009 with limited 
passage since. 


7H9* 


316FNLD1978 
(Vaccine strain) 


Obtained from the VLA in 1978 and used as a killed vaccine [38], 

hfl'j-" J J- 4-1 /" 4_|\/4_- 1 J-'-L 4_ 1 1 . 1 K \ j_|„ 1 -J 

Maintained at the Central Veterinary Institute, Lelystad, Netherlands. 


Potato starch medium (P.Willemsen 
personal communication) 


316FNE04/81 
(Vaccine strain) 


Neoparasec vaccine (Merial, France) subcultures from a stock [25] assumed to be 
derived from a 316 F Weybridge UK strain purchased in the 1980s. 


7H9*or 7H11** 


316FNE08/81 
(Vaccine strain) 






316FNE068451-2 
(Vaccine strain) 






316FNE069341 
(Vaccine strain) 






316v 


Australian strain derived from a variant labelled 31 6f around 1986 [48] which itself 
was obtained from a New Zealand source who obtained the strain in the early 1980s. 
Maintained at the University of Sydney, Sydney, Australia. 


Mycobactin free medium but grew poorly 
(R Whittington, personal communication) 


316FUK2000 
(Vaccine strain) 


Obtained from the VLA in 2000 and maintained at St George's Hospital Medical 
School, London, UK, with limited passage. 


Liquid Watson Reid f 


316FUK2001 
(Vaccine strain) 


Obtained as a lyophilised stock from the VLA in 2001 and maintained at the 
Moredun Research Institute, Scotland, UK. 


7H11** 


316FNLD2008 
(Vaccine strain) 


Obtained from VLA in 2008 and maintained at the Central Veterinary Institute, 

i l j_ l h i j_ l l l 

Lelystad, Netherlands 


HEYM 


IIUK2000 
(Vaccine strain) 


Obtained from the VLA in 2000 and maintained at St George's Hospital Medical 
School, London, UK. 


Liquid Watson Reid + [14] 


IIUK2001 
(Vaccine strain) 


Obtained as a lyophilised stock from the VLA in 2001 and maintained at the 
Moredun Research Institute, Scotland, UK. 


7H11** 


2eUK2000 
(Vaccine strain) 


Obtained from the VLA in 2000 and maintained at St George's Hospital Medical 
School, London, UK. 


Liquid Watson Reid 'A' Block ++ medium 


2eUK2001 
(Vaccine strain) 


Obtained as a lyophilised stock from the VLA in 2001 and maintained at the 
Moredun Research Institute, Scotland, UK. 


7H11" 


MAPK10 (Wild 
type strain) 


Purchased from ATCC: BAA-968. Sequenced reference strain isolated from a 
cow in 1990. 


7H9*or 7H11** 


CAM87 (Wild 
type strain) 


MAP Type III strain isolated from a goat in 2005 [26] and maintained at 
the Universidad Complutense de Madrid, Madrid, Spain. 


7H9* 


JD87/107 (Wild 
type strain) 


Isolated from a deer in 1987 and maintained at the Moredun Research Institute, 
Scotland, UK. 


7H11** 



*7H9: Middlebrooks 7H9 (Becton Dickinson, UK) supplemented with 2 mg/L Mycobactin J (Allied Monitor, USA), 0.5% glycerol, 10% oleic acid-albumin-dextrose- 
catalase (OADC) enrichment medium (Difco, UK), 25 mg/L amphotericin B, 35 mg/L naladixic acid and 35 mg/L vancomycin. 

**7H11: Middlebrooks 7H11 (Becton Dickinson, UK) agar supplemented with 2 mg/L Mycobactin J (Allied Monitor, USA), 2.5% glycerol (v/v), 10% OADC (v/v) 
enrichment medium (Difco, UK), 20% (v/v) new born calf serum, 0.3 g/L asparagine, Mycobacteria Selectatabs (10 mg/L amphotericin B, 200,000 units/L polymixin 
B, 100 mg/L ticarcillin and 10 mg/L trimethoprim [MAST Laboratories Ltd, UK]). 

f Liquid Watson Reid: Asparagine 5 g/l, Potassium dihydrogen phosphate 2 g/l, Magnesium suphate 1 g/l, Tri-ammonium citrate 2 g/l, Sodium chloride 2 g/l, D(+) 
Glucose 10 g/l, Glycerol 48 ml/1, Ammonium iron (III) citrate brown 0.075 g/l, 1.33mls of Supplement A: 2 g/l Zinc sulphate, 2 g/l Copper sulphate, 1 g/l Cobalt 
nitrate, 1.33mls of Supplement B; 50 g/l Calcium chloride. 

n Liquid Watson Reid 'A' Block: as Watson Reid medium but without supplements A and B. 



(Stock 121 mg/ml Sigma). Samples were ribolysed (Lysing 
Matrix B; Qbiogene) at 6,500 rpm for 50 sec, iced for 
10 min then 400 ul of lysate added to 400 ul of Qiagen 
DNAeasy AL lysis buffer, mixed and applied to a DNAeasy 
column. 200 ul of 100% ethanol was added and columns 
centrifuged at 8,000 rpm for 1 min, washed in 500 ul 



Qiagen Lysis buffer 1 and 2, then eluted in 90 ul DNA/ 
RNAse free H 2 0 overnight on the column at 4°C. 

MIRU3 typing and \S900 locus PCR 

Five microlitres of MAP DNA extracted from test strains 
was amplified for MIRU [49] or IS 900 [40] as previously 
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described using 2 uM primers MIRU3.F& MIRU3.R 
spanning the MAP3982-MAP3983 locus or with IS900 
locus specific primers designed to amplify across the 
complete IS900 insertion from immediately adjacent loci 
(Table 6). All PCR reactions used lx Expand reaction buf- 
fer containing 1.5 mM MgCl 2 , 10% DMSO, 100 uM dNTP 
and 1 unit Expand High Fidelity Taq polymerase (Roche). 
Cycling conditions were: 95°C: 3 min: 1 cycle; 94°C: 30 sec : 
60°C: 30 sec : 72°C : 1 min : 35 cycles; 72°C : 5 min : 1 cycle. 
Confirmation of amplicon product size in bp was made on 
1.8% agarose gels. 

MAPAC microarray hybridisation and analysis 

DNA from the test strain and reference MAP K-10 strain 
were fluorescently labelled and hybridised to the micro- 
array using protocols described previously [50]. Briefly, 
1 ug of DNA was labelled by random priming with 
Klenow polymerase to incorporate either Cy3 or Cy5 
dCTP (GE Healthcare) for the test strain and reference 
strain respectively. Equal amounts of the Cy3 and Cy5 la- 
belled samples were co-purified through a Qiagen 
MinElute column (Qiagen), mixed with a formamide- 
based hybridisation solution (lxMES, 1 M NaCl, 20% 
formamide, 0.02 M EDTA, 1% Triton) and denatured at 
95°C for 2 min. The labelled sample was loaded on to a 
prehybridised (3.5xSSC, 0.1% SDS, 10 mg/ml BSA) micro- 
array under two 22x22 mm LifterSlips (Erie Scientific), 
sealed in a humidified hybridisation cassette (Corning) 
and hybridised overnight by immersion in a waterbath at 
55°C for 16-20 h. Slides were washed once in 400 ml 
lxSSC 0.06% SDS at 55°C for 2 min and twice in 400 ml 
0.06xSSC for 2 min. Microarrays were scanned using 
an Affymetrix 428 scanner, and signal intensity data 
were extracted using BlueFuse for Microarrays v3.5 
(BlueGnome). The intensity data was further post- 
processed using BlueFuse to exclude both controls and 
low confidence data (p<0.1) prior to normalisation by 2D 
Lowess (window size=20) and median centring. Further 
analysis of the normalised data was undertaken using 
BlueFuse, GeneSpring 7.3.1 (Agilent Technologies) and 
Eisen Cluster [51]. Fully annotated microarray data 
has been deposited in BuG@Sbase (accession number: E- 
BUGS-264; http://bugs.sgul.ac.uk/E-BUGS-264) and also 
ArrayExpress (accession number: E-BUGS-264). 

qPCR 

Relative copy number of amplicons were generated 
from IS900, MAP0101 and MAP0103 primer sets and 
compared against absolute amplicon numbers obtained 
from a MAP2114c primer set (Table 6). Reactions 
comprised 2 ul genomic DNA sample, 12.5 ul Power SYBR 
green mastermix (Applied Biosystems, Cat 4368706), 2 



pMoles appropriate primer pairs, made to 25 ul with 
RNAse free H20. PCR cycling used 95°C:15mins (1 cycle); 
at 95°C:30secs, 58°C :1 min, 72°C:1 min (40 cycles) with 
data collection at 76°C (lOsecs) using a CFX96 qPCR cycler 
(BioRad, UK). Sample copy numbers were estimated from 
an averaged value of three qPCR's on each sample using a 
dilution curve of a control stock total genomic DNA MAP 
K-10 preparation serially diluted 10 fold to contain between 
1 x 10 2 -10 6 genome copies. 



Tellurite MIC 

Cultures of MAP strains were grown in conventional li- 
quid media to exponential phase for 6 weeks then adjusted 
to 10 4 cfu/ml using OD550. Aliquots (10 ul) were 
inoculated onto solid RAF medium in petri dishes 
containing serial dilution of potassium tellurite to final 
concentrations of 512, 256, 128, 64, 32, 16, 8, 4 or 0 ug/ml 
and incubated at 37°C. MIC's were taken as the least tel- 
lurite concentration able to inhibit >90% growth, seen as 
black colonies, after 6 weeks of growth and 12 weeks 
growth for strain IIUK2000 which was slower to grow 
in vitro. 



Assessment of virulence using a mouse model 

The virulence of vaccine strains 316FUK2001, IIUK2001 
and 2eUK2001 was compared with wild type strain JD87/ 
107 in a mouse model. 316FUK2001, IIUK2001 and 
2eUK2001 were selected to represent the three different 
vaccine strains that have been used for control of JD over 
the years. JD87/107 was selected as the control strain as 
this was the virulent wild type strain that was used previ- 
ously in our laboratory to optimise the mouse model and 
PBS was used as a negative control. C57BL/6 mice of ap- 
proximately five weeks old and between 20 and 25 g in 
weight were purchased from Harlan UK, Shaws Farm, 
Blackthorn, Bicester, Oxon OX25 1TP. The mice were indi- 
vidually weighed and randomly assigned to five groups of 
30. One negative control group was inoculated with 0.1 ml 
of sterile PBS. The remaining groups were inoculated 
intraperitoneally with 1.1 to 1.4 x 10 organisms in 0.1 ml 
PBS of one of the MAP strains 2eUK2001, IIUK2001, 
316FUK2001 and JD87/107. The inocula were prepared as 
previously described [52] and enumerated by performing a 
microscopic count. Ten mice from each group were killed 
at 4, 8 and 12 weeks post inoculation by exposure to a mix- 
ture of carbon dioxide and halothane gas followed by cer- 
vical dislocation. Each mouse was weighed and the body 
weight recorded. The spleens and livers were removed 
aseptically and weighed. The respective weights were 
expressed as a percentage of body weight for each 
mouse. Approximately 0.1 g of liver was removed for 
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bacteriological culture and the remaining tissue fixed in 
10% formal saline for histopathological examination. 

Hepatic bacterial load was assessed by using serial 
dilutions of a 10% (wt/wt) liver homogenate in PBS. 0.1 ml 
of each dilution was inoculated onto 7H11 agar with 
supplements as detailed in Table 8 and incubated at 37°C 
for up to 16 weeks. Colony counts for each animal repli- 
cate were estimated by fitting a generalised linear model 
to the dilution assay counts assuming an overdispersed 
Poisson response and a logarithmic link function while fit- 
ting the logarithm of the dilution as an offset variable to 
the fixed mean. It was assumed that observations greater 
than 200 CFU per field could not be quantified accurately, 
and such observations were included in the likelihood as 
taking unknown values greater than this threshold. 

The fixed liver samples were given a random code to 
assure that the samples were assessed blind by the path- 
ologist. The samples were processed to paraffin wax and 
5 um sections prepared. The sections were stained with 
haematoxylin and eosin (H&E) and Ziehl-Neelsen (ZN) 
method . Using an Olympus BX50 microscope, the num- 
ber of leucocyte clusters was counted in approximately 
100 fields at x200 magnification from each individual 
animal using an eyepiece graticule (Pyser-SGI Ltd, 
NE35-24 mm) and the counts normalised to 1 field. A 
leucocyte cluster was defined as an accumulation of 
more than 10 mononuclear leucocytes. The infectious 
load of each animal was assessed by counting leucocyte 
clusters containing AFB. Because the detection of AFB 
requires higher magnification (x400-600), the number of 
leucocyte clusters with AFB was assessed separately. De- 
pending on the original leucocyte cluster density, up to 
sixty leucocyte clusters were assessed in detail and the 
proportion of leucocyte clusters containing AFB 
determined. Based on the leucocyte cluster counts and 
the proportion of leucocyte clusters containing AFB, the 
infectious load was expressed as the mean number of 
AFB positive leucocyte clusters per field. 

All data were analysed by fitting a linear mixed model to 
either the data as specified above or to the ranks of these 
data, with this choice being made on the basis of the nor- 
mality of residuals in the model fitted to the original data. 
The mixed model approach was preferred to traditional 
ANOVA to better allow for replicates missing at random 
from the sample. Strain and Week and interactions were 
fitted as fixed effects, animal replicate as a random re- 
sidual effect. Statistical analysis was carried out using 
Genstat version 14 and using user-defined macros in Excel 
2007. All statistical analysis and derivations of P values are 
provided in Additional File 2). 

Ethical considerations 

All experimental procedures and management protocols 
were examined and approved by the Moredun Research 



Institute Experiments and Ethics Committee and conducted 
within the framework of the UK Animals (Scientific 
Procedures) Act 1986' administered by the Home Office of 
the UK government. 
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Additional file 2: Mouse Model Data File. Tables and statistical 
analyses of virulence experiments in mice. 
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